make_parallel for network containers #985

emailweixu · 2021-08-25T00:01:09Z

This change supports creating parallel network for network containers. This is achieved by implementing make_parallel for various layers and transforming all the modules in the container.

With this change, it will be trivial to implement make_parallel for all kinds of networks if they are implemented using network containers.

le-horizon

Thanks for the nice improvement Wei.

Looks good as is, just small comments.

Thanks,
Le

alf/layers.py

le-horizon · 2021-08-25T16:54:56Z

alf/networks/networks.py

@@ -171,7 +178,8 @@ def __init__(self,
                 stack_size,
                 pooling_size=1,
                 dtype=torch.float32,
-                 mode='skip'):
+                 mode='skip',
+                 name='TemporalPool'):


Looking at the TemporalPool example above, it seems if pool size == 2, we always ignore every other (half) of the timesteps.
Have we considered pooling every timestep, instead of pooling every pool_size timesteps?

Is mode "avg" what you want?

alf/layers.py

alf/layers_test.py

alf/layers.py

hnyu · 2021-08-25T20:56:09Z

alf/layers.py

+
+    A parallel network has ``n`` copies of network with the same structure but
+    different independently initialized parameters. The parallel network can
+    process a batch of the data with shape [batch_size, n, ...] using ``n``


Should we automatically convert data with shape [batch_size, ...] to [batch_size, n, ...]?

As network can have sub-networks, doing this check for all of them can be wasteful. So it's intended to use make_parallel_input to do the conversion by the user.

hnyu · 2021-08-25T21:01:37Z

alf/nest/utils.py

@@ -104,6 +105,10 @@ def _combine_flat(self, tensors):
        else:
            return torch.cat(tensors, dim=self._dim)

+    def make_parallel(self, n):
+        dim = self._dim if self._dim < 0 else self._dim + 1


Should add comments saying that self._dim excludes batch dim and parallel dim. The current implementation seems not to ignore the batch dim.

Changed the behavior of NestConcat to not including batch dim.
Comments added.

Haichao-Zhang

The comments are very helpful. Just some more minor points.

alf/layers.py

alf/utils/spec_utils.py

* make_parallel for network containers * Address comments * Address further comments

make_parallel for network containers

d9c560b

emailweixu requested review from Haichao-Zhang and le-horizon August 25, 2021 00:01

le-horizon reviewed Aug 25, 2021

View reviewed changes

Haichao-Zhang reviewed Aug 25, 2021

View reviewed changes

alf/layers.py Outdated Show resolved Hide resolved

alf/layers.py Outdated Show resolved Hide resolved

alf/layers_test.py Outdated Show resolved Hide resolved

Haichao-Zhang reviewed Aug 25, 2021

View reviewed changes

alf/layers.py Show resolved Hide resolved

alf/layers.py Show resolved Hide resolved

hnyu requested changes Aug 25, 2021

View reviewed changes

Address comments

438ca21

Haichao-Zhang reviewed Aug 25, 2021

View reviewed changes

alf/layers.py Outdated Show resolved Hide resolved

hnyu reviewed Aug 25, 2021

View reviewed changes

alf/utils/spec_utils.py Outdated Show resolved Hide resolved

Address further comments

42ab187

Haichao-Zhang approved these changes Aug 25, 2021

View reviewed changes

hnyu approved these changes Aug 25, 2021

View reviewed changes

emailweixu merged commit b8cb37a into pytorch Aug 26, 2021

hnyu deleted the PR_parallel_container branch August 26, 2021 17:17

emailweixu linked an issue Aug 30, 2021 that may be closed by this pull request

Implement make_parallel for network containers #762

Open

pd-perry pushed a commit to pd-perry/alf that referenced this pull request Dec 11, 2021

make_parallel for network containers (HorizonRobotics#985)

cb10639

* make_parallel for network containers * Address comments * Address further comments

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

make_parallel for network containers #985

make_parallel for network containers #985

emailweixu commented Aug 25, 2021

le-horizon left a comment

le-horizon Aug 25, 2021

emailweixu Aug 25, 2021

hnyu Aug 25, 2021

emailweixu Aug 25, 2021

hnyu Aug 25, 2021 •

edited

Loading

emailweixu Aug 25, 2021

Haichao-Zhang left a comment

make_parallel for network containers #985

make_parallel for network containers #985

Conversation

emailweixu commented Aug 25, 2021

le-horizon left a comment

Choose a reason for hiding this comment

le-horizon Aug 25, 2021

Choose a reason for hiding this comment

emailweixu Aug 25, 2021

Choose a reason for hiding this comment

hnyu Aug 25, 2021

Choose a reason for hiding this comment

emailweixu Aug 25, 2021

Choose a reason for hiding this comment

hnyu Aug 25, 2021 • edited Loading

Choose a reason for hiding this comment

emailweixu Aug 25, 2021

Choose a reason for hiding this comment

Haichao-Zhang left a comment

Choose a reason for hiding this comment

hnyu Aug 25, 2021 •

edited

Loading